EEMD-Based Speaker Automatic Emotional Recognition in Chinese Mandarin
نویسندگان
چکیده
Emotion feature extraction is the key to speech emotional recognition. And ensemble empirical mode decomposition(EEMD) is a newly developed method aimed at eliminating emotion mode mixing present in the original empirical mode decomposition(EMD). To evaluate the performance of this new method, this paper investigates the effect of a parameters pertinent to EEMD: speech emotional envelope. Firstly, a speaker emotional envelope features extraction based on EEMD is proposed in the paper. Using the piecewise power function in speech emotional envelope has a better effect in emotional identification. At the same time, the proposed technique has been utilized for classification of four kinds of emotional(angry, happy, sad and neutral) speech signals. Emotional intrinsic mode functions(IMFe) are obtained by empirical mode decomposition on emotional speech signals, the fast fourier transform(FFT) of each intrinsic mode function is extracted as the emotional feature coefficient which is used in speaker emotional identification applying by vector quantization. MATLAB is used to calculate the characteristic of emotional speech signals using empirical mode decomposition (EEMD). We obtain an emotional envelope by transforming the IMFe of emotional speech signals, and obtain a new method of emotion recognition according to different emotional envelop feature vectors. The results indicate the proposed method works well in speaker emotional identification.
منابع مشابه
Automatic speech recognition in Mandarin for embedded platforms
In this paper, we describe a real-time automatic speech recognition system for Mandarin for low-cost embedded platforms using fixed-point digital signal processors. The hands-free, speaker-independent speech recognition system employs 41 mono-phone models for representing the sounds in Mandarin Chinese and 11 whole-word models for connected digit recognition. The system achieves greater than 98...
متن کاملReal-time rich-content transcription of Chinese broadcast news
This paper describes the recent development of an Audio Indexing System for Chinese (Mandarin) broadcast news. Key issues of the three major components: automatic speech recognition, speaker identification and named entity extraction are addressed. The Chinese-language-specific challenges are discussed and our solutions are described. The recognition accuracy of the final system is comparable t...
متن کاملIncorporating Pitch Features for Tone Modeling in Automatic Recognition of Mandarin Chinese
Tone plays a fundamental role in Mandarin Chinese, as it plays a lexical role in determining the meanings of words in spoken Mandarin. For example, these two sentences R R (I like horses) and R M (I like to scold) differ only in the tone carried by the last syllable. Thus, the inclusion of tone-related information through analysis of pitch data should improve the performance of automatic speech...
متن کاملiCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent
We present iCALL, a speech corpus designed to evaluate Mandarin Chinese pronunciation patterns of non-native speakers of European descent, developed at the Institute for Infocomm Research (IR) in Singapore. To the best of our knowledge, iCALL is larger than any reported non-native corpora to date in terms of utterance number, duration, and number of speakers: iCALL consists of 90,841 utterances...
متن کاملRegression class selection and speaker adaptation with MLLR in Mandarin continuous speech recognition
Currently, CDHMM based continuous speech recognition has been widely extended to speaker-independent (SI) system. However, the performance of the SI system is highly dependent on the speakers, especially for Mandarin speech with accent, speaker adaptation becomes crucial important for real application. In this paper, MLLR approach is studied for speaker adaptation in mandarin continuous speech ...
متن کامل